Evolving from Bioinformatics in-the-Small to Bioinformatics in-the-Large
نویسندگان
چکیده
We argue the significance of a fundamental shift in bioinformatics, from in-the-small to in-the-large. Adopting a large-scale perspective is a way to manage the problems endemic to the world of the small-constellations of incompatible tools for which the effort required to assemble an integrated system exceeds the perceived benefit of the integration. Where bioinformatics in-the-small is about data and tools, bioinformatics in-the-large is about metadata and dependencies. Dependencies represent the complexities of large-scale integration, including the requirements and assumptions governing the composition of tools. The popular make utility is a very effective system for defining and maintaining simple dependencies, and it offers a number of insights about the essence of bioinformatics in-the-large. Keeping an in-the-large perspective has been very useful to us in large bioinformatics projects. We give two fairly different examples, and extract lessons from them showing how it has helped. These examples both suggest the benefit of explicitly defining and managing knowledge flows and knowledge maps (which represent metadata regarding types, flows, and dependencies), and also suggest approaches for developing bioinformatics database systems. Generally, we argue that large-scale engineering principles can be successfully adapted from disciplines such as software engineering and data management, and that having an in-the-large perspective will be a key advantage in the next phase of bioinformatics development.
منابع مشابه
Bioinformatics to Biostochastics: Statistical Perspectives and Tasks Ahead
Bioinformatics is an emerging field of science emphasizing the application of mathematics, statistics, and informatics to study and analysis of very large molecular biological (mostly, genetic and genomic) systems (data sets). In a comparatively broader setup of large biological systems without necessarily having a predominant genetic undercurrent, and having genesis in biometry to biostatistic...
متن کاملDeciphering the functional role of hypothetical proteins from Chloroflexus aurantiacs J-10-f1 using bioinformatics approach
Chloroflexus aurantiacus J-10-f1 is an anoxygenic, photosynthetic, facultative autotrophic gram negative bacterium found from hot spring at a temperature range of 50-60°C. It can sustain itself in dark only if oxygen is available thereby exhibiting a dark orange color, however display a dark green color when grown in sunlight. Genome of the organism contains total of 3853 proteins out ...
متن کاملBIOINFORMATICS EVALUATION OF T.FOENUM ACTIVE COMPOUNDS IN SUPPRESSION OF Α-GLUCOSIDASE ENZYME
Background: Diabetes mellitus is a metabolic syndrome characterized by elevated blood glucose. The α-glucosidase enzymes that are found in the small intestine are responsible for the hydrolysis of carbohydrates. The aim of this study was to Bioinformatics evaluation of T.foenum active compounds in suppression of α-glucosidase enzyme. Methods: This study was a descriptive-analytical method. For...
متن کاملAn enzymatic and bioinformatics study of native cutinase bacteria
Cutin is a polymer that is constructed in plants by the condensation and oxidation of fatty acids and plays a key role in the protection of plants against pathogens. Cutinase is a hydrolase enzyme that breaks down the cutin. The purpose of this study was to extract cutin from red apples with oxalate buffer, cutinase enzyme activity assay in LB culture, and bioinformatic analysis. To attain thes...
متن کاملNovel Applications of Immuno-bioinformatics in Vaccine and Bio-product Developments at Research Institutes
There are many challenges in the field of public health sciences. Rational decisions are required in order to treat different diseases, gain knowledge and wealth regarding research, and produce biological or synthetic products. Various advances in the basic laboratory science, computer science, and the engineering of biological production processes can help solve the occurring problems. Bioinfo...
متن کاملComparative bioinformatics analysis of a wild diploid Gossypium with two cultivated allotetraploid species
Background: Gossypium thurberi is a wild diploid species that has been used to improve cultivated allotetraploid cotton. G. thurberi belongs to D genome, which is an important wild bio-source for the cotton breeding and genetic research. To a certain degree, chloroplast DNA sequence information are a versatile tool for species identification and phylogenetic implications in plants. Different ch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Omics : a journal of integrative biology
دوره 7 1 شماره
صفحات -
تاریخ انتشار 2003